Speed up wave.resource module #352

akeeste · 2024-09-17T19:07:12Z

@ssolson This is a follow-up to my other wave PRs and resolves #331. Handling the various edge cases robustly in pure numpy is difficult, so I want to first resolve #331 by using DataArrays throughout the wave resource functions instead of Datasets.

Similar to Ryan's testing mentioned in #331, I found that using DataArrays/Pandas has a 1000x speed up vs Datasets for very large input data. This should restore MHKiT's speed to it's previous state. Using a pure numpy base would have an additional 5-10x speed up from DataArrays, but I think the current work with DataArrays will:

be sufficient for our users
be easier to develop with
be easier to handle edge cases

Before I go forward and apply this change to the rest of the wave.resource module, can you test out energy_period and frequency_moment and try to break them? With the appropriate frequency_dimension input, those functions should handle Pandas Series, Pandas DataFrames, and xarray DataArrays regardless of input shape, dimensions names, dimension order, etc.

…ay base

This reverts commit a2d5f61.

…aframes and 2+ var datasets

akeeste added 4 commits September 12, 2024 14:50

fix assignment in type_handling

fcc910e

temporary testing file

b19217c

initial conversion of energy_period and frequency_moment to DataArray

5addc17

energy_period working with variety of types and converting to dataArr…

ac91b92

…ay base

akeeste requested a review from ssolson September 17, 2024 19:07

akeeste added 13 commits September 24, 2024 14:34

extend xr.dataarray basis to all wave.resource functions

5d914a3

remove testing script

8683b88

black formatting

e860034

fix most test formatting

3382825

use dataarrays instead of datasets in wave.performance

c5241af

revert surface_elevation function back to datasets

a2d5f61

Revert "revert surface_elevation function back to datasets"

e016fff

This reverts commit a2d5f61.

allow datasets, 2d dataframes. Update test formatting appropriately

a719620

simplify and improve robustness of convert_to_dataarray for 1-var dat…

d585cb5

…aframes and 2+ var datasets

update test formatting

ac5b436

clean up frequency_bin and method checks in elevation_surface

afc7f8c

update and annotate type_handling

8f1647f

black formatting

c0d72d0

akeeste marked this pull request as ready for review October 2, 2024 18:34

akeeste marked this pull request as draft October 2, 2024 18:44

akeeste added 3 commits October 2, 2024 14:08

minor type fix

8dddf42

update type references in loads

3a170ff

update type references in loads - v2

42a85d8

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Speed up wave.resource module #352

Speed up wave.resource module #352

akeeste commented Sep 17, 2024

Speed up wave.resource module #352

Are you sure you want to change the base?

Speed up wave.resource module #352

Conversation

akeeste commented Sep 17, 2024